CRF Errors Corrected by th STIC Syst ms Branch 9 A / 

I 1 Changed a fil from non-ASCII to ASCII Verified by: /* O I (STIC staff) 

I I Changed the margins in cases wh re the sequence text was \vrapped" doj^n to the next line. 
[ [ Edited a format error in the Current Application Data section, specifically. * ' 






I I Edited the Current Application Data section with the actual current number. The number inpwo^^ the 
applicant was Q the prior application data; or Q other . 

I I Added the mandatory heading and subheadings for "Current Application Data". 

I [ Edited the "Number of Sequences" field. The applicant spelled out a number instead of using an integ r. 

I I Changed the spelling of a mandatory field (the headings or subheadings), specifically: 

1 I Corrected the SEQ ID NO when obviously incorrect. The sequence numbers that were edited were: 

I I Inserted or corrected a nucleic number at the end of a nucleic line. SEQ ID NO's edited: 

I I Corrected subheading placement. All responses must be on the same line as each subheading. If the 
applicant placed a response below the subheading, this was moved to its appropriate place. 

I I Inserted colons after headings/subheadings. Headings edited included: 

[77] Delete<£fxtr£> invalid, headings used by an applicant, specifically: 

^ ^PPl/c^^ 

I [ Deleted: □ non-ASCII "garbage" at the beginning/end of files; □ secretary initials/filename at end of file; 

□ page numbers throughout text; □ other invalid text, such as . 

I I Inserted mandatory headings, specifically: 



| 1 Corrected an obvious error in the response, specifically: 



I 1 Edited identifiers where upper case is used but lower case is required, or vice versa. 

I I Corrected an error in the Number of Sequences field, specifically: 



1 1 A "Hard Page Break" code was inserted by the applicant. All occurrences had to be deleted 

□ 



I I Deleted ending stop codon in amino acid sequences and adjusted the "(A)Length:" field accordingly (error 
due to a Patentln bug). Sequences corrected: 



Qther: (K) F/l/au, date* jL+JX^ jjuJU, Cut) cjucm,z«t 



'Examin r: Th abov corrections must b communicated to the applicant in the first Offic 
Action. DO NOT s nd a copy of this form. 3/1/95 



, TEAM 8 

PAGE: l ..' RAW SEQUENCE LISTING DATE: 02/21/97 

•_„ . . [. PATENT. APPLICATION US/08/785,455. _. .._ TIME: 15:30:00 

INPUT SET: SI 5668. raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



1 SEQUENCE LISTING 

0, 



3 (1) General Information Cn/> °Gsa. 

> 5 (i) APPLICANT QtSfau^Pb 

> 6 (A) NAME: Hodgson, John ™9 A/ 

> 7 (B) STREET: Great West Road G &6rt 

> 8 (C) CITY: Brentford Q 

> 9 (D) STATE OR PROVINCE: Middlesex 

> 10 (E) COUNTRY: United Kingdom 

> 11 (F) POSTAL CODE: TW8 9EP 

12 ^ 

13 t APP L ICA NT^ 

> 14 (A) NAME: Hodgson, John 

> 15 (B) STREET: Great West Road 

> 16 (C) CITY: Brentford 

> 17 (D) STATE OR PROVINCE: Middlesex 

> 18 (E) COUNTRY: United Kingdom 

> 19 (F) POSTAL CODE: TW8 9EP 
20 

21 (ii) TITLE OF THE INVENTION: Novel tRNA Synthetase 

22 

23 

24 (iii) NUMBER OF SEQUENCES: 2 
25 

26 (iv) COMPUTER-READABLE FORM: 

27 (A) MEDIUM TYPE: Diskette 

28 (B) COMPUTER: IBM Compatible 

29 (C) OPERATING SYSTEM: DOS 

30 (D) SOFTWARE: FastSEQ for Windows Version 2.0 
31 

32 (v) CURRENT APPLICATION DATA: 

3 3 (A) APPLICATION NUMBER: 

34 C6> F)UN(rfi/}TE: 

35 (2) INFORMATION FOR SEQ ID NO : 1 : 
36 

37 (i) SEQUENCE CHARACTERISTICS: 

38 (A) LENGTH: 1974 base pairs 

39 (B) TYPE: nucleic acid 

40 (C) STRANDEDNESS : double 

41 (D) TOPOLOGY: linear 
42 

4 3 (ii) MOLECULE TYPE: Genomic DNA 
44 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
46 



PAGE: 2 . RAW SEQUENCE LISTING DATE: 02/21/97 

PATENT APPLICATION. US/08/785,455 TIME: 15:30:03 

INPUT SET: S15668.raw 

47 ATGGCTAAAG AAACATTTTA TATAACAACC CCAATATACT ATCCTAGTGG GAATTTACAT 60 

48 ATAGGACATG CATATTCTAC AGTGGCTGGA GATGTTATTG CAAGATATAA GAGAATGCAA 120 

49 GGATATGATG TTCGTTATTT GACTGGAACG GATGAACACG GTCAAAAAAT TCAAGAAAAA 180 

50 GCTCAAAAAG CTGGTAAGAC AGAAATTGAA TATTTGGATG AGATGATTGC TGGAATTAAA 240 

51 CAATTGTGGG CTAAGCTTGA AATTTCAAAT GATGATTTTA TCAGAACAAC TGAAGAACGT 300 

52 CATAAACATG TCGTTGAGCA AGTGTTTGAA CGTTTATTAA AGCAAGGTGA TATCTATTTA 360 

53 GGTGAATATG AAGGTTGGTA TTCTGTTCCG GATGAAACAT ACTATACAGA GTCACAATTA 420 

54 GTAGACCCAC AATACGAAAA CGGTAAAATT ATTGGTGGCA AAAGTCCAGA TTCTGGACAC 480 

55 GAAGTTGAAC TAGTTAAAGA AGAAAGTTAT TTCTTTAATA TTAGTAAATA TACAGACCGT 540 

56 TTATTAGAGT TCTATGACCA AAATCCAGAT TTTATACAAC C AC CATC A AG AAAAAATGAA 600 

57 ATGATTAACA ACTTCATTAA ACCAGGACTT GCTGATTTAG CTGTTTCTCG TACATCATTT 660 

58 AACTGGGGTG TCCATGTTCC GTCTAATCCA AAACATGTTG TTTATGTTTG GATTGATGCG 720 

59 TTAGTTAACT ATATTTCAGC ATTAGGCTAT TTATCAGATG ATGAGTCACT ATTTAACAAA 780 

60 TACTGGCCAG CAGATATTCA TTTAATGGCT AAGGAAATTG TGCGATTCCA CTCAATTATT 840 

61 TGGCCTATTT TATTGATGGC ATTAGACTTA CCGTTACCTA AAAAAGTCTT TGCACATGGT 900 

62 TGGATTTTGA TGAAAGATGG AAAAATGAGT AAATCTAAAG GTAATGTTGT AGACCCTAAT 960 

63 ATTTTAATTG ATCGCTATGG TTTAGATGCT ACACGTTATT ATCTAATGCG TGAATTACCA 1020 

64 TTTGGTTCAG ATGGCGTATT TACACCTGAA GCATTTGTTG AGCGTACAAA TTTCGATCTA 1080 

65 GCAAATGACT TAGGTAACTT AGTAAACCGT ACGATTTCTA TGGTTAATAA GTACTTTGAT 1140 

66 GGCGAATTAC CAGCGTATCA AGGTCCACTT CATGAATTAG ATGAAGAAAT GGAAGCTATG 1200 

67 GCTTTAGAAA CAGTGAAAAG CTACACTGAA AGCATGGAAA GTTTGCAATT TTCTGTGGCA 1260 

68 TTATCTACGG TATGGAAGTT TATAAGTAGA ACGAATAAGT ATATTGACGA AACAACGCCT 1320 

69 TGGGTATTAG CTAAGGACGA TAGCCAAAAA GATATGTTAG GCAATGTAAT GGCTCACTTA 1380 

70 GTTGAAAATA TTCGTTATGC AGCTGTATTA TTACGTCCAT TCTTAACACA TGCGCCGAAA 1440 

71 GAGATTTTTG AACAATTGAA CATAAACAAT CCTCAATTTA TGGAATTTAG TAGTTTAGAG 1500 

72 CAATATGGTG TGCTTACTGA GTCAATTATG GTTACTGGGC AACCTAAACC TATTTTCCCA 1560 

73 AGATTGGATA GCGAAGCGGA AATTGCATAT ATCAAAGAAT CAATGCAACC GCCTGCTACT 1620 

74 GAAGAGGAAA AAGAAGAGAT TCCTAGCAAA CCTCAAATTG ATATTAAAGA CTTTGATAAA 1680 

75 GTTGAAATTA AGGCAGCAAC GATTATTGAT GCTGAACATG TTAAGAAGTC AGATAAGCTT 1740 

76 TTAAAAATTC AAGTAGACTT AGATTCTGAA CAAAGACAAA TTGTATCAGG AATTGCCAAA 1800 

77 TTCTATACAC CAGATGATAT TATTGGTAAA AAAGTAGCAG TTGTTACTAA CCTGAAACCA 1860 

78 GCTAAATTAA TGGGACAAAA ATCTGAAGGT ATGATATTAT CTGCTGAAAA AGATGGTGTA 1920 
7 9 TTAACCTTAG TAAGTTTACC AAGTGCAATT CCAAATGGTG CAGTGATTAA ATAA 1974 
80 

81 (2) INFORMATION FOR SEQ ID NO: 2: 

82 



88 

89 (ii) MOLECULE TYPE: peptide 

90 

91 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

92 

93 Met Ala Lys Glu Thr Phe Tyr lie Thr Thr Pro lie Tyr Tyr Pro Ser 

94 1 5 10 15 

95 Gly Asn Leu His lie Gly His Ala Tyr Ser Thr Val Ala Gly Asp Val 

96 20 25 30 

97 lie Ala Arg Tyr Lys Arg Met Gin Gly Tyr Asp Val Arg Tyr Leu Thr 

98 35 40 45 

99 Gly Thr Asp Glu His Gly Gin Lys lie Gin Glu Lys Ala Gin Lys Ala 



PAGE: 3 RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/785,455 



100 




50 










101 


Gly 


Lys 


Thr 


Glu 


He 


Glu 


102 


65 










70 


103 


Gin 


Leu 


Trp 


Ala 


Lys 


Leu 


104 










85 




105 


Thr 


Glu 


Glu 


Arg 


His 


Lys 


106 








100 






107 


Leu 


Lys 


Gin 


Gly 


Asp 


He 


108 






115 








109 


Val 


Pro 


Asp 


Glu 


Thr 


Tyr 


110 




130 










111 


Tyr 


Glu 


Asn 


Gly 


Lys 


He 


112 


145 










150 


113 


Glu 


Val 


Glu 


Leu 


Val 


Lys 


114 










165 




115 


Tyr 


Thr 


Asp 


Arg 


Leu 


Leu 


116 








180 






117 


Gin 


Pro 


Pro 


Ser 


Arg 


Lys 


118 






195 








119 


Gly 


Leu 


Ala 


Asp 


Leu 


Ala 


120 




210 










121 


His 


Val 


Pro 


Ser 


Asn 


Pro 


122 


225 










230 


123 


Leu 


Val 


Asn 


Tyr 


He 


Ser 


124 










245 




125 


Leu 


Phe 


Asn 


Lys 


Tyr 


Trp 


126 








260 






127 


lie 


Val 


Arg 


Phe 


His 


Ser 


128 






275 








129 


Asp 


Leu 


Pro 


Leu 


Pro 


Lys 


130 




290 










131 


Lys 


Asp 


Gly 


Lys 


Met 


Ser 


132 


305 










310 


133 


lie 


Leu 


He 


Asp 


Arg 


Tyr 


134 










325 




135 


Arg 


Glu 


Leu 


Pro 


Phe 


Gly 


136 








340 






137 


Val 


Glu 


Arg 


Thr 


Asn 


Phe 


138 






355 








139 


Asn 


Arg 


Thr 


He 


Ser 


Met 


140 




370 










141 


Ala 


Tyr 


Gin 


Gly 


Pro 


Leu 


142 


385 










390 


143 


Ala 


Leu 


Glu 


Thr 


Val 


Lys 


144 










405 




145 


Phe 


Ser 


Val 


Ala 


Leu 


Ser 


146 








420 






147 


Lys 


Tyr 


He 


Asp 


Glu 


Thr 


148 






435 








149 


Gin 


Lys 


Asp 


Met 


Leu 


Gly 


150 




450 










151 


Arg 


Tyr 


Ala 


Ala 


Val 


Leu 


152 


465 










470 



55 










60 




Tyr 


Leu 


Asp 


Glu 


Met 


He 


Ala 










75 






Glu 


He 


Ser 


Asn 


Asp 


Asp 


Phe 








90 








His 


Val 


Val 


Glu 


Gin 


Val 


Phe 






105 










Tyr 


Leu 


Gly 


Glu 


Tyr 


Glu 


Gly 




120 










125 


Tyr 


Thr 


Glu 


Ser 


Gin 


Leu 


Val 


135 










140 




He 


Gly 


Gly 


Lys 


Ser 


Pro 


Asp 










155 






Glu 


Glu 


Ser 


Tyr 


Phe 


Phe 


Asn 








170 








Glu 


Phe 


Tyr 


Asp 


Gin 


Asn 


Pro 






185 










Asn 


Glu 


Met 


He 


Asn 


Asn 


Phe 




200 










205 


Val 


Ser 


Arg 


Thr 


Ser 


Phe 


Asn 


215 










220 




Lys 


His 


Val 


Val 


Tyr 


Val 


Trp 










235 






Ala 


Leu 


Gly 


Tyr 


Leu 


Ser 


Asp 








250 








Pro 


Ala 


Asp 


He 


His 


Leu 


Met 






265 










He 


He 


Trp 


Pro 


He 


Leu 


Leu 




280 










285 


Lys 


Val 


Phe 


Ala 


His 


Gly 


Trp 


295 










300 




Lys 


Ser 


Lys 


Gly 


Asn 


Val 


Val 










315 






Gly 


Leu 


Asp 


Ala 


Thr 


Arg 


Tyr 








330 








Ser 


Asp 


Gly 


Val 


Phe 


Thr 


Pro 






345 










Asp 


Leu 


Ala 


Asn 


Asp 


Leu 


Gly 




360 










365 


Val 


Asn 


Lys 


Tyr 


Phe 


Asp 


Gly 


375 










380 




His 


Glu 


Leu 


Asp 


Glu 


Glu 


Met 










395 






Ser 


Tyr 


Thr 


Glu 


Ser 


Met 


Glu 








410 








Thr 


Val 


Trp 


Lys 


Phe 


He 


Ser 






425 










Thr 


Pro 


Trp 


Val 


Leu 


Ala 


Lys 




440 










445 


Asn 


Val 


Met 


Ala 


His 


Leu 


Val 


455 










460 




Leu 


Arg 


Pro 


Phe 


Leu 


Thr 


His 



475 



DATE: 02/21/97 
TIME: 15:30:07 

INPUT SET: SI 5668. raw 



Gly 


He 


Lys 






80 


He 


Arg 


Thr 




95 




Glu 


Arg 


Leu 


110 






Trp 


Tvr 


Ser 


Asp 


Pro 


Gin 


Ser 


Gly 


His 






160 


He 


Ser 


Lys 




175 




Asp 


Phe 


He 


190 






He 


Lys 


Pro 


TrD 


Glv 


Val 


He 


Asp 


Ala 






240 


Asp 


Glu 


Ser 




255 




Ala 


Lys 


Glu 


270 






Met 


Ala 


Leu 


He 


Leu 


Met 


Asp 


Pro 


Asn 






320 


Tyr 


Leu 


Met 




335 




Glu 


Ala 


Phe 


350 






Asn 


Leu 


Val 


Glu 


Leu 


Pro 


Glu 


Ala 


Met 






400 


Ser 


Leu 


Gin 




415 




Arg 


Thr 


Asn 


430 






Asp 


Asp 


Ser 


Glu 


Asn 


He 


Ala 


Pro 


Lys 



480 



PAGE: 4 . RAW SEQUENCE LISTING DATE: 02/21/97 

PATENT APPLICATION US/08/785^455 . . TIME: 15:30:11 

INPUT SET: S15668.mw 

153 Glu He Phe Glu Gin Leu Asn He Asn Asn Pro Gin Ph Met Glu Phe 

154 485 490 495 

155 Ser Ser Leu Glu Gin Tyr Gly Val Leu Thr Glu Ser He Met Val Thr 

156 500 505 510 

157 Gly Gin Pro Lys Pro He Phe Pro Arg Leu Asp Ser Glu Ala Glu He 

158 515 520 525 

159 Ala Tyr He Lys Glu Ser Met Gin Pro Pro Ala Thr Glu Glu Glu Lys 

160 530 535 540 

161 Glu Glu He Pro Ser Lys Pro Gin He Asp He Lys Asp Phe Asp Lys 

162 545 550 555 560 
16 3 Val Glu He Lys Ala Ala Thr He He Asp Ala Glu His Val Lys Lys 

164 565 570 575 

165 Ser Asp Lys Leu Leu Lys He Gin Val Asp Leu Asp Ser Glu Gin Arg 

166 580 585 590 

167 Gin lie Val Ser Gly lie Ala Lys Phe Tyr Thr Pro Asp Asp He He 

168 595 600 605 

169 Gly Lys Lys Val Ala Val Val Thr Asn Leu Lys Pro Ala Lys Leu Met 

170 610 615 620 

171 Gly Gin Lys Ser Glu Gly Met He Leu Ser Ala Glu Lys Asp Gly Val 

172 625 630 635 640 

173 Leu Thr Leu Val Ser Leu Pro Ser Ala lie Pro Asn Gly Ala Val He 

174 645 650 655 

175 Lys 
176 

177 



